perm filename PAPER.DOC[4,KMC] blob
sn#061934 filedate 1973-08-31 generic text, type T, neo UTF8
00100 ␈↓ ↓-IDIOLECTIC LANGUAGE-ANALYSIS FOR
00200 ␈↓␈↓ mUNDERSTANDING DOCTOR-PATIENT DIALOGUES*
00300
00400 ␈↓ ↓>Horace Enea and Kenneth Mark Colby
00500
00600 ␈↓ ↓↑Department of Computer Science
00700 ␈↓ αAStanford University
00800 ␈↓ αDStanford, California
00900
01000
01100
01200
01300
01400
01500
01600
01700
01800
01900
02000
02100
02200
02300
02400
02500
02600
02700
02800
02900 ␈↓---------------------------------------------------------
03000 ␈↓*␈αThis␈αresearch␈αis␈αsupported␈αby␈αGrant␈αPHS␈αMH␈α06645-12␈αfrom
03100 the␈α⊂National␈α⊃Institute␈α⊂of␈α⊂Mental␈α⊃Health,␈α⊂by␈α⊂(in␈α⊃part)␈α⊂Research
03200 Scientist␈α≤Award␈α≤(No.␈α≤ 1-K05-K14,433)␈α≤from␈α≤the␈α≤National
03300 Institute␈α⊃of␈α⊂Mental␈α⊃Health␈α⊂to␈α⊃the␈α⊂second␈α⊃author␈α⊂and␈α⊃(in␈α⊂part)
03400 by␈α⊃the␈α⊃Advanced␈α⊃Research␈α∩Projects␈α⊃Agency␈α⊃of␈α⊃the␈α∩Office␈α⊃of
03500 the␈αSecretary␈αof␈αDefense(SD-183).
03600 ␈↓---------------------------------------------------------
03700
03800 ␈↓␈↓ α\␈↓
03900 ABSTRACT␈↓
04000
04100 ␈↓ A␈α∂programming␈α∂language␈α⊂is␈α∂described␈α∂which␈α∂is␈α⊂designed␈α∂to
04200 simplify␈α↔the␈α⊗construction␈α↔of␈α⊗computer␈α↔programs␈α↔to␈α⊗analyze
04300 English.␈α⊃This␈α⊃system␈α⊂attempts␈α⊃to␈α⊃merge␈α⊂the␈α⊃best␈α⊃features␈α⊂of
04400 pattern␈α≤matchers␈α≤and␈α≥the␈α≤phrase␈α≤structure␈α≥approach␈α≤to
04500 languagα⊂analysisα⊃ Severaα⊂practicaα⊃problemα⊂whicα⊃occuα⊂in
04600 dealing␈αwith␈αsuch␈αa␈αsystem␈αare␈αdescribed.
04700
04800 ␈↓ α)␈↓
04900 INTRODUCTION␈↓
05000
05100 ␈↓ Why␈α∃is␈α∃it␈α∃so␈α∃difficult␈α∃for␈α∃machines␈α∃to␈α∃understand
05200 natural␈α→language?␈α_Perhaps␈α→it␈α→is␈α_because␈α→machines␈α→do␈α_not
05300 simulate␈α∃sufficiently␈α∀what␈α∃humans␈α∀do␈α∃when␈α∃humans␈α∀process
05400 language.␈α∞ Several␈αyears␈α∞of␈αexperience␈α∞with␈α∞computer␈αscience
05500 and␈α~linguistic␈α≠approaches␈α~have␈α~taught␈α≠us␈α~the␈α≠scope␈α~and
05600 limitations␈αof␈αsyntactic␈αand␈αsemantic␈αparsers.(Schank,Tesler␈αand
05700 Weber,␈↓↓8␈↓␈α Simmons,␈↓↓9␈↓␈α Winograd,␈↓↓13␈↓␈α Woods␈↓↓14␈↓).␈α!While␈α extant
05800 linguistic␈α∪parsers␈α∩perform␈α∪satisfactorily␈α∩with␈α∪carefully␈α∩edited
05900 text␈α∂sentences␈α∞or␈α∂with␈α∂small␈α∞dictionaries␈α∂,␈α∞they␈α∂are␈α∂unable␈α∞to
06000 deal␈α∂with␈α∂everyday␈α∞language␈α∂behavior␈α∂characteristic␈α∂of␈α∞human
06100 conversation.␈α≡ In␈α≡a␈α∨rationalistic␈α≡quest␈α≡for␈α∨certainty␈α≡and
00100 attracted␈α⊂by␈α⊂an␈α⊂analogy␈α⊂from␈α∂the␈α⊂proof␈α⊂theory␈α⊂of␈α⊂logicians␈α∂in
00200 ␈↓which␈α∂provability␈α∞implied␈α∂computability,␈α∂computational␈α∞linguists
00300 hoped␈α∪to␈α∩develop␈α∪formalisms␈α∩for␈α∪natural␈α∪language␈α∩grammars.
00400 But␈α⊂the␈α⊃hope␈α⊂has␈α⊃not␈α⊂been␈α⊂realized␈α⊃and␈α⊂perhaps␈α⊃in␈α⊂principle
00500 cannot␈α∀be.␈α∀ (It␈α∀is␈α∀difficult␈α∀to␈α∀formalize␈α∀something␈α∀which␈α∪can
00600 hardly␈αbe␈αformulated).
00700
00800 Linguistic␈α~parsers␈α~use␈α~morphographemic␈α~analyses,
00900 parts-of-speech␈α'assignments␈α'and␈α'dictionaries␈α'containing
01000 multiple␈α≥word-senses␈α≡each␈α≥possessing␈α≡semantic␈α≥features,
01100 programs␈α⊗or␈α⊗rules␈α⊗for␈α⊗restricting␈α⊗word␈α↔combinations.␈α⊗ Such
01200 parsers␈α∩perform␈α⊃a␈α∩detailed␈α⊃analysis␈α∩of␈α⊃every␈α∩word,␈α⊃valiantly
01300 disambiguating␈α↔at␈α↔each␈α↔step␈α↔in␈α↔an␈α↔attempt␈α↔to␈α↔construct␈α⊗a
01400 meaningful␈α≤interpretation.␈α≤ While␈α≤it␈α≤may␈α≤be␈α≠sophisticated
01500 computationally,␈α∪a␈α∀conventional␈α∪parser␈α∀is␈α∪quite␈α∀at␈α∪a␈α∀loss␈α∪to
01600 deal␈α∩with␈α⊃the␈α∩caprice␈α∩of␈α⊃ordinary␈α∩conversation.␈α∩ In␈α⊃everyday
01700 discourse␈α⊂people␈α⊂speak␈α⊃colloquially␈α⊂and␈α⊂idiomatically␈α⊃using␈α⊂all
01800 sorts␈α_of␈α_pat␈α_phrases,␈α_ slang␈α_and␈α_cliches.␈α_ The␈α_number␈α_of
01900 special-case␈α↔expressions␈α_is␈α↔indefinitely␈α↔large.␈α_ Humans␈α↔are
02000 cryptic␈α∞and␈α∂elliptic.␈α∞ They␈α∞lard␈α∂even␈α∞their␈α∂written␈α∞expressions
02100 with␈α≤meaningless␈α≤fillers␈α≤and␈α≤fragments.They␈α≤convey␈α≠their
02200 intentions␈α∀and␈α∀ideas␈α∀in␈α∀idiosyncratic␈α∀and␈α∃metaphorical␈α∀ways,
02300 blithely␈αviolating␈α∞rules␈αof␈α'correct'␈α∞grammar␈αand␈α∞syntax.␈α Given
02400 these␈αdifficulties,␈α∞how␈αis␈α∞it␈αthat␈α∞people␈αcarry␈α∞on␈αconversations
02500 easily␈α∂most␈α∞of␈α∂the␈α∞time␈α∂while␈α∞machines␈α∂thus␈α∞far␈α∂have␈α∂found␈α∞it
02600 extremely␈α∃difficult␈α∃to␈α∀continue␈α∃to␈α∃make␈α∃appropriate␈α∀replies
02700 indicating␈αsome␈αdegree␈αof␈αunderstanding?
02800
02900 It␈α_seems␈α→that␈α_people␈α→`get␈α_the␈α→message'␈α_without
03000 always␈α∩analyzing␈α∩every␈α∩single␈α∩word␈α∩in␈α∩the␈α∩input.␈α∩They␈α∩even
03100 ignore␈α_some␈α↔of␈α_its␈α_terms.␈α↔People␈α_make␈α_individualistic␈α↔and
03200 idiosyncratic␈α⊃selections␈α⊃from␈α⊃highly␈α⊃redundant␈α⊃and␈α⊂repetitious
03300 communications.␈α⊃ These␈α⊃personal␈α⊃selective␈α⊃operations,␈α⊃ based
03400 on␈α∃idiosyncratic␈α∃intentions,␈α∃produce␈α∃a␈α∃transformation␈α∃of␈α∃the
03500 input␈α∂by␈α∂destroying␈α∂and␈α∂even␈α∂distorting␈α∂information.␈α∂ In␈α∞speed
03600 reading,␈α∪for␈α∪example,␈α∪only␈α∪a␈α∪small␈α∪percentage␈α∪of␈α∪contentive
03700 words␈α≠on␈α≠each␈α≠page␈α~need␈α≠be␈α≠looked␈α≠at.␈α≠ These␈α~words
03800 somehow␈α→resonate␈α~with␈α→the␈α→readers␈α~relevant␈α→conceptual-
03900 belief␈α⊃structure␈α⊃whose␈α∩processes␈α⊃enable␈α⊃him␈α∩to␈α⊃`understand'
04000 not␈α⊂simply␈α⊂the␈α⊂language␈α⊂but␈α⊂all␈α⊂sorts␈α⊂of␈α⊂unmentioned␈α∂aspects
04100 about␈α↔the␈α↔situations␈α↔and␈α↔events␈α↔being␈α↔referred␈α↔to␈α_in␈α↔the
04200 language.␈α∂ Normal␈α∞written␈α∂English␈α∞text␈α∂is␈α∞estimated␈α∂to␈α∂be␈α∞5/6
04300 redundant␈α9 (Rubenstein␈α8 and␈α9 Haberstroh␈↓↓7␈↓).␈α8Spoken
04400 conversations␈α∨in␈α∨English␈α∨are␈α∨probably␈α∨better␈α∨than␈α≡50%
04500 redundant(Carroll␈↓↓1␈↓).␈α≡Words␈α≡can␈α≡be␈α≡garbled␈α≡and␈α≥listeners
04600 nonetheless␈α∂get␈α∂the␈α⊂gist␈α∂or␈α∂drift␈α∂of␈α⊂what␈α∂is␈α∂being␈α⊂said.␈α∂ They
04700 see␈αthe␈α"pattern"␈αand␈αthus␈αcan␈αsupply␈αmuch␈αof␈αwhat␈αis␈αmissing.
04800
04900 To␈α"approximate␈α!such␈α"human␈α"achievements␈α!we
05000 require␈α_a␈α_new␈α_perspective␈α_and␈α_a␈α_practical␈α_method␈α_which
05100 differs␈α≥from␈α≥that␈α≡of␈α≥current␈α≥linguistic␈α≡approaches.␈α≥ This
05200 alternate␈α∨approach␈α∨should␈α∨incorporate␈α∨those␈α∨aspects␈α∨of
05300 parsers␈α⊃which␈α⊃have␈α⊃been␈α∩found␈α⊃to␈α⊃work␈α⊃well,␈α∩e.g.,␈α⊃detecting
05400 embedded␈α⊂clauses.␈α⊂ Also␈α⊂individualistic␈α⊂features␈α∂characteristic
05500 of␈α an␈α∨idiolect␈α should␈α∨have␈α dominant␈α emphasis.␈α∨Parsers
05600 represent␈α∞complex␈α∞and␈α∞refined␈α∞algorithms.␈α∞ While␈α∞on␈α∞one␈αhand
05700 they␈α≡subject␈α≡a␈α∨sentence␈α≡to␈α≡a␈α≡detailed␈α∨and␈α≡sometimes
05800 overkilling␈α≡analysis,␈α≥on␈α≡the␈α≥other,␈α≡they␈α≥are␈α≡finicky␈α≥and
05900 oversensitive.␈α⊗ A␈α⊗conventional␈α⊗parser␈α⊗may␈α⊗simply␈α⊗halt␈α⊗if␈α⊗a
00100 word␈α∂in␈α⊂the␈α∂input␈α∂sentence␈α⊂is␈α∂not␈α∂present␈α⊂in␈α∂its␈α⊂dictionary.␈α∂It
00200 ␈↓finds␈α∪ungrammatical␈α∩expressions␈α∪such␈α∩as␈α∪double␈α∩prepositions
00300 (`Do␈α~you␈α~want␈α~to␈α≠get␈α~out␈α~of␈α~from␈α~the␈α≠hospital?')␈α~quite
00400 confusing.␈α_ Parsers␈α→constitute␈α_a␈α→tight␈α_conjunction␈α→of␈α_tests
00500 rather␈α⊂than␈α∂a␈α⊂loose␈α⊂disjunction.␈α∂ As␈α⊂more␈α∂and␈α⊂more␈α⊂tests␈α∂are
00600 added␈α∂to␈α∞the␈α∂conjunction,␈α∂the␈α∞parser␈α∂behaves␈α∞like␈α∂a␈α∂finer␈α∞and
00700 finer␈α≡filter␈α≡which␈α∨makes␈α≡it␈α≡increasingly␈α≡difficult␈α∨for␈α≡an
00800 expression␈α⊂to␈α∂pass␈α⊂through.␈α∂ Parsers␈α⊂do␈α∂ not␈α⊂ allow␈α⊂ for␈α∂ the
00900 exclusions␈αtypical␈αof␈αeveryday␈αhuman␈αdialogues.
01000
01100 Finally,␈α∂it␈α∂is␈α⊂difficult␈α∂to␈α∂keep␈α∂consistent␈α⊂a␈α∂dictionary
01200 of␈α≡over␈α≡500␈α≡multiple-sense␈α≡words␈α≡classified␈α≡by␈α≡binary
01300 semantic␈α∂features␈α∂or␈α∂rules.␈α⊂For␈α∂example,␈α∂suppose␈α∂a␈α⊂noun␈α∂(Ni)
01400 is␈α∀used␈α∀by␈α∀some␈α∀verbs␈α∪as␈α∀a␈α∀direct␈α∀object␈α∀in␈α∀the␈α∪semantic
01500 sense␈α⊂of␈α⊃a␈α⊂physical␈α⊃object.␈α⊂Then␈α⊂it␈α⊃is␈α⊂noticed␈α⊃that␈α⊂Ni␈α⊃is␈α⊂also
01600 used␈α∞by␈α∞other␈α∞verbs␈α∞in␈α∞the␈α∞sense␈α∞of␈α∞a␈α∞location␈α∞so␈α∞`location'␈αis
01700 added␈α⊗to␈α⊗Ni's␈α⊗list␈α⊗of␈α⊗semantic␈α⊗features.␈α⊗ Now␈α↔Ni␈α⊗suddenly
01800 qualifies␈αas␈αa␈αdirect␈αobject␈αfor␈αa␈αlot␈αof␈αother␈αverbs.␈α But␈αsome
01900 of␈α_the␈α_resultant␈α_combinations␈α_make␈α_no␈α_sense␈α_even␈α_in␈α_an
02000 idiolect.␈α⊃If␈α⊂a␈α⊃special␈α⊂feature␈α⊃is␈α⊂then␈α⊃created␈α⊂for␈α⊃Ni,␈α⊃then␈α⊂one
02100 loses␈α_the␈α↔power␈α_of␈α_general␈α↔classes␈α_of␈α_semantic␈α↔features.
02200 Adding␈α∞a␈α∞single␈α∞semantic␈α∞feature␈α∞can␈α∞result␈α∞in␈α∞the␈αpropagation
02300 of␈α∃hidden␈α∀inconsistencies␈α∃and␈α∀unwanted␈α∃side-effect..␈α∃as␈α∀the
02400 dictionary␈α∞grows␈α∂it␈α∞becomes␈α∂increasingly␈α∞unstable␈α∂and␈α∞difficult
02500 to␈αcontrol.
02600
02700 Early␈α"attempts␈α!to␈α"develop␈α"a␈α!pattern-matching
02800 approach␈α)using␈α)special-purpose␈α)heuristics␈α*have␈α)been
02900 described␈α↔by␈α⊗Colby,␈α↔Watt␈α⊗and␈α↔Gilbert,␈↓↓2␈↓␈α↔Weizenbaum␈↓↓11␈↓␈α⊗and
03000 Colby␈α∩and␈α∩Enea.␈↓↓3␈↓␈α∩The␈α⊃limitations␈α∩of␈α∩these␈α∩attempts␈α∩are␈α⊃well
03100 known␈α∪to␈α∪workers␈α∪in␈α∪artificial␈α∪intelligence.␈α∪The␈α∪man-machine
03200 conversations␈α∪of␈α∀such␈α∪programs␈α∪soon␈α∀becomes␈α∪impoverished
03300 and␈α∩boring.␈α∩Such␈α⊃primitive␈α∩context-restricted␈α∩programs␈α⊃often
03400 grasp␈α∪a␈α∩topic␈α∪well␈α∪enough␈α∩but␈α∪too␈α∩often␈α∪do␈α∪not␈α∩understand
03500 quite␈α∃what␈α∃is␈α∃being␈α∃said␈α∃about␈α∃the␈α∃topic,␈α∃with␈α∃amusing␈α∃or
03600 disastrous␈αconsequences.␈αThis␈αshortcoming␈αis␈αa␈αconsequence␈αof
03700 the␈α⊂limitations␈α⊂of␈α⊂a␈α⊂pattern-␈α⊂matching␈α⊂approach␈α⊂lacking␈α⊂a␈α⊂rich
03800 conceptual␈α∩structure␈α∩into␈α∪which␈α∩the␈α∩pattern␈α∪abstracted␈α∩from
03900 the␈α∀input␈α∃can␈α∀be␈α∀matched␈α∃for␈α∀inferencing.␈α∃ These␈α∀programs
04000 also␈α∀lack␈α∀a␈α∀subroutine␈α∀structure,␈α∀both␈α∀pattern␈α∀directed␈α∀and
04100 specific,␈αdesirable␈αfor␈αgeneralizations␈αand␈αfurther␈αanalysis.
04200
04300 The␈α⊃strength␈α⊃of␈α⊃these␈α⊃pattern␈α⊃matching␈α⊂approaches
04400 lies␈α∞in␈αtheir␈α∞ability␈αto␈α∞ignore␈αsome␈α∞of␈αthe␈α∞input.␈α They␈α∞look␈αfor
04500 patterns,␈α∃which␈α∃means␈α∃the␈α∀emphasis␈α∃of␈α∃some␈α∃detail␈α∃to␈α∀the
04600 exclusion␈α∞of␈α∞other␈α∞detail.␈α∞ Thus␈α∞they␈α∞can␈α∞get␈α∞something␈α∞out␈α∞of
04700 nearly␈αevery␈αsentence--␈αsometimes␈αmore,␈αsometimes␈αless.
04800
04900 An␈α*interesting␈α)pattern-matching␈α*approach␈α)for
05000 machine␈α≤translation␈α≤has␈α≤been␈α≤developed␈α≤by␈α≥Wilks.␈↓↓12␈↓␈α≤His
05100 program␈α∂constructs␈α∂a␈α∂pattern␈α∞from␈α∂English␈α∂text␈α∂input␈α∂which␈α∞is
05200 matched␈α∪against␈α∪templates␈α∪in␈α∪an␈α∪interlingual␈α∪data␈α∪base␈α∪from
05300 which,in␈α⊗turn,␈α∃ French␈α⊗output␈α⊗is␈α∃generated␈α⊗without␈α⊗using␈α∃a
05400 generative␈αgrammar.
05500
05600 In␈α$the␈α%course␈α$of␈α$constructing␈α%an␈α$interactive
05700 simulation␈α∀of␈α∀paranoia␈α∀we␈α∀were␈α∀faced␈α∀with␈α∀the␈α∀problem␈α∀of
05800 dealing␈αwith␈αunedited␈αand␈αunrestricted␈αnatural␈αlanguage␈αas␈αit␈αis
05900 used␈α$in␈α$the␈α#doctor-patient␈α$situation␈α$of␈α$a␈α#psychiatric
00100 interview.(Colby,␈α⊃Hilf,␈α⊃Weber,␈α⊃and␈α⊃Kraemer,␈↓↓4␈↓␈α⊃Colby␈α⊃and␈α⊂Hilf␈↓↓5␈↓).
00200 ␈↓This␈α+domain␈α+of␈α+discourse␈α+admittedly␈α,contains␈α+many
00300 psychiatrically␈α∩stereotyped␈α⊃expressions␈α∩and␈α⊃is␈α∩constrained␈α⊃in
00400 topics␈α∃(Newton`s␈α∃laws␈α∃are␈α∃rarely␈α∃discussed).␈α∃But␈α∃it␈α⊗is␈α∃rich
00500 enough␈α∪in␈α∪verbal␈α∀behavior␈α∪to␈α∪be␈α∀a␈α∪challenge␈α∪to␈α∀a␈α∪language
00600 understanding␈α∂algorithm␈α∂since␈α∂a␈α∂variety␈α∂of␈α∂human␈α∂experiences
00700 are␈αdiscussed␈αdomain␈αincluding␈αthe␈αinterpersonal␈αrelation␈αwhich
00800 develops␈α⊃between␈α⊃the␈α⊃interview␈α⊃participants.␈α⊃ A␈α⊃look␈α∩at␈α⊃the
00900 contents␈α∞of␈α∞a␈α∞thesaurus␈α∞reveals␈α∞that␈α∞words␈α∞relating␈α∞to␈α∞people
01000 and␈αtheir␈αinterrelations␈αmake␈αup␈αroughly␈α70%␈αof␈αEnglish␈αwords.
01100
01200 The␈α∩diagnosis␈α∩of␈α∩paranoia␈α∩is␈α∩made␈α∪by␈α∩psychiatrists
01300 relying␈α_mainly␈α↔on␈α_the␈α_verbal␈α↔behavior␈α_of␈α_the␈α↔interviewed
01400 patient.␈α∞ If␈αa␈α∞paranoid␈α∞model␈αis␈α∞to␈αexhibit␈α∞paranoid␈α∞behavior␈αin
01500 a␈α≠psychiatric␈α≠interview,␈α≠ it␈α≠must␈α≠be␈α≠capable␈α≠of␈α~handling
01600 dialogues␈α⊗typical␈α↔of␈α⊗the␈α↔doctor-patient␈α⊗context.␈α↔ Since␈α⊗the
01700 model␈α∩can␈α∪communicate␈α∩only␈α∩through␈α∪teletyped␈α∩messages,the
01800 vis-a-vis␈α≠aspects␈α≠of␈α≠the␈α≠usual␈α≠psychiatric␈α≤interview␈α≠are
01900 absent.␈α_ Therefore␈α_the␈α_model␈α↔must␈α_be␈α_able␈α_to␈α_deal␈α↔with
02000 unedited␈α⊗typewritten␈α↔natural␈α⊗language␈α↔input␈α⊗and␈α↔to␈α⊗output
02100 replies␈α∂which␈α∂are␈α∂indicative␈α∞of␈α∂an␈α∂underlying␈α∂paranoid␈α∞thought
02200 process␈αduring␈αthe␈αepisode␈αof␈αa␈αpsychiatric␈αinterview.
02300
02400 In␈α≠an␈α≠interview␈α≤there␈α≠is␈α≠always␈α≠a␈α≤who␈α≠saying
02500 something␈αto␈αa␈αwhom␈αwith␈αdefinite␈αintentions␈αand␈αexpectations.
02600 There␈α∩are␈α∩two␈α∪situations␈α∩to␈α∩be␈α∪taken␈α∩into␈α∩account,␈α∪the␈α∩one
02700 being␈α~talked␈α~about␈α~and␈α→the␈α~one␈α~the␈α~participants␈α~are␈α→in.
02800 Sometimes␈α↔the␈α↔latter␈α⊗becomes␈α↔the␈α↔former.␈α↔ Participants␈α⊗in
02900 dialogues␈α∪have␈α∪intentions␈α∀and␈α∪dialogue␈α∪algorithms␈α∀must␈α∪take
03000 this␈α⊃into␈α⊃account.␈α⊂ The␈α⊃doctor's␈α⊃intention␈α⊂is␈α⊃to␈α⊃gather␈α⊂certain
03100 kinds␈α∀of␈α∪information␈α∀while␈α∪the␈α∀patient's␈α∪intention␈α∀is␈α∀to␈α∪give
03200 information␈α∂and␈α∂get␈α∂help.␈α∂ A␈α∂job␈α∞is␈α∂to␈α∂be␈α∂done;␈α∂it␈α∂is␈α∂not␈α∞small
03300 talk.␈α∩ Our␈α∩working␈α∩hypothesis␈α∩is␈α∩that␈α∩each␈α∩participant␈α∩in␈α∩the
03400 dialogue␈α%understands␈α%the␈α%other␈α%by␈α&matching␈α%selected
03500 idiosyncratically-␈α≤significant␈α≤patterns␈α≤in␈α≤the␈α≥input␈α≤against
03600 conceptual␈α≡patterns␈α≥which␈α≡contain␈α≥information␈α≡about␈α≥the
03700 situation␈α$or␈α$event␈α$being␈α$described␈α$linguistically.␈α# This
03800 understanding␈α"is␈α"communicated␈α"reciprocally␈α#by␈α"linguistic
03900 responses␈α(judged␈α)appropriate␈α(to␈α(the␈α)intentions␈α(and
04000 expectations␈α∩of␈α∪the␈α∩participants␈α∩and␈α∪to␈α∩the␈α∪requirements␈α∩of
04100 the␈α∞situation.␈α∂In␈α∞this␈α∂paper␈α∞we␈α∞shall␈α∂describe␈α∞only␈α∂our␈α∞current
04200 input-analyzing␈α_processes␈α_used␈α_to␈α_extract␈α_a␈α_pattern␈α_from
04300 natural␈α↔language␈α↔input.␈α_ In␈α↔a␈α↔later␈α↔communication␈α_we␈α↔shall
04400 describe␈α"the␈α"inferential␈α!processes␈α"carried␈α"out␈α"at␈α!the
04500 conceptual␈αlevel␈αonce␈αa␈αpattern␈αhas␈αbeen␈αreceived␈αby␈αmemory
04600 from␈αthe␈αinput-analysing␈αprocesses.
04700
04800 Studies␈α↔of␈α↔our␈α↔1971␈α↔model␈α↔of␈α_paranoia␈α↔(PARRY)
04900 indicated␈α∞that␈α∞about␈α∞thirty␈α∞percent␈α∞of␈α∞the␈α∞sentences␈α∞were␈αnot
05000 understood␈α∩at␈α∪all␈α∩,␈α∩that␈α∪is,␈α∩no␈α∩concept␈α∪in␈α∩the␈α∪sentence␈α∩was
05100 recognized.␈α_ In␈α_a␈α_somewhat␈α↔larger␈α_number␈α_of␈α_cases␈α↔some
05200 concepts,␈α∂but␈α∂not␈α∂all,␈α⊂ were␈α∂recognized.␈α∂ In␈α∂many␈α⊂cases␈α∂these
05300 partially␈α∂recognized␈α⊂sentences␈α∂lead␈α⊂to␈α∂a␈α⊂partial␈α∂understanding
05400 that␈α∞was␈α∞sufficient␈α∂to␈α∞gather␈α∞the␈α∂intention␈α∞of␈α∞the␈α∂speaker␈α∞and
05500 thus␈α≠lead␈α≠to␈α≠output␈α≠an␈α≠appropriate␈α≠response.␈α~ However,
05600 misunderstandings␈αoccurred␈αtoo␈αoften.␈αFor␈αexample:
00100 ␈↓∞ DOCTOR: How old is your mother ?
00200
00300 PARRY: Twenty-eight
00400
00500 ␈↓PARRY␈α⊃has␈α⊃interpreted␈α⊃the␈α∩question␈α⊃as␈α⊃referring␈α⊃to␈α∩his␈α⊃own
00600 age␈αand␈α∞answered␈αby␈αgiving␈α∞his␈αage.␈α The␈α∞purpose␈αof␈α∞our␈αnew
00700 language␈α⊂analysis␈α⊂system␈α⊂is␈α⊃to␈α⊂significantly␈α⊂raise␈α⊂the␈α⊃level␈α⊂of
00800 understanding␈α∞by␈α∂preventing␈α∞such␈α∞misunderstandings␈α∂while␈α∞not
00900 restricting␈α∪what␈α∩can␈α∪be␈α∪said␈α∩to␈α∪PARRY.␈α∩ We␈α∪do␈α∪not␈α∩expect
01000 complete␈α∃under-␈α∀standing␈α∃from␈α∃this␈α∀system␈α∃--␈α∃even␈α∀native
01100 speakers␈α∪of␈α∪the␈α∪language␈α∩do␈α∪not␈α∪completely␈α∪understand␈α∩the
01200 language.
01300
01400 By␈α`understanding'␈αwe␈αmean␈αthe␈αsystem␈αcan␈αdo␈αsome
01500 or␈αall␈αof␈αthe␈αfollowing:
01600
01700 1)␈α+Determine␈α*the␈α+intention␈α+of␈α*the
01800 interviewer␈α1in␈α0making␈α1a␈α0particular
01900 utterance.
02000
02100 2)␈α_Make␈α_common␈α_logical␈α_deductions␈α_that
02200 follow␈αfrom␈αthe␈αinterviewers␈αutterance
02300
02400 3)␈α∞Form␈α∞an␈α∞idioletic␈α∂internal␈α∞representation
02500 of␈α∂the␈α∂utterance␈α∂so␈α∂that␈α∂questions␈α∂may␈α∂be
02600 answered,␈α∀commands␈α∀carried␈α∀out,␈α∃or␈α∀data
02700 added␈αto␈αmemory.
02800
02900 4)␈α∂Determine␈α∂references␈α∂for␈α∂pronouns,␈α∂and
03000 other␈αanaphora.
03100
03200 5)␈α∃Deduce␈α∀the␈α∃tone␈α∀of␈α∃the␈α∀utterance,i.e.,
03300 hostile,␈αinsulting...
03400
03500 6)␈α!Classify␈α"the␈α!input␈α!as␈α"a␈α!question,
03600 rejoinder,command,␈α...
03700
03800 ␈↓ The␈α∞approach␈α∂we␈α∞are␈α∞taking␈α∂consists␈α∞of␈α∂merging␈α∞the
03900 best␈α⊃features␈α∩of␈α⊃pattern␈α∩directed␈α⊃systems␈α∩such␈α⊃as␈α∩the␈α⊃MAD
04000 DOCTOR,␈↓↓2␈↓␈α⊃ELIZA␈↓↓11␈↓␈α⊃and␈α⊂parsing␈α⊃directed␈α⊃systems␈α⊃for␈α⊂example,
04100 Winograd,␈↓↓13␈↓␈α⊃Woods.␈↓↓14␈↓.␈α⊃ By␈α⊂merging␈α⊃the␈α⊃BNF␈α⊃phrase␈α⊂structure
04200 approach␈α≤ot␈α≤analyzing␈α≤English␈α≤with␈α≤the␈α≤pattern␈α≠matching
04300 approach,␈αwith␈αits␈αattendant␈αemphasis␈αof␈αsome␈αconcepts␈αto␈αthe
04400 exclusion␈α↔of␈α⊗others.␈α↔ The␈α⊗programs␈α↔to␈α⊗accomplish␈α↔this␈α⊗are
04500 written␈α∂in␈α∂MLISP2,␈α⊂ an␈α∂extensible␈α∂version␈α∂of␈α⊂the␈α∂programming
04600 language␈α∃MLISP,␈↓↓6,10␈↓␈α∀and␈α∃uses␈α∃an␈α∀interpreted␈α∃version␈α∃of␈α∀the
04700 pattern␈α⊗matcher␈α∃designed␈α⊗for␈α∃a␈α⊗new␈α⊗programming␈α∃language
04800 LISP70.
04900
05000 The␈α∩following␈α∩is␈α∩a␈α⊃basic␈α∩description␈α∩of␈α∩the␈α⊃pattern
05100 matcher.␈α⊃ We␈α⊃shall␈α⊃illustrate␈α⊃its␈α⊃operation␈α⊃using␈α∩examples␈α⊃of
05200 problems␈αcommon␈αto␈αteletyped␈αpsychiatric␈αdialogues.
05300
05400 ␈↓ ↓j␈↓
05500 PATTERN MATCHING␈↓
05600
05700 ␈↓ Pattern␈α∪directed␈α∀computation␈α∪involves␈α∪two␈α∀kind␈α∪of
05800 operations␈α5on␈α4data␈α5structures:␈α5decomposition␈α4and
05900 recomposition.␈α∀ Decomposition␈α∀breaks␈α∀down␈α∀an␈α∃input␈α∀stream
06000 into␈αcomponents␈αunder␈αthe␈αdirection␈αof␈αa␈αdecompostion␈αpattern
00100 ("dec").␈α∪ The␈α∩inverse␈α∪operation,␈α∩recomposition,␈α∪constructs␈α∩an
00200 ␈↓output␈α⊂stream␈α∂under␈α⊂the␈α⊂direction␈α∂of␈α⊂a␈α⊂recomposition␈α∂pattern
00300 ("rec").
00400
00500 A␈αrewrite␈αrule␈αis␈αof␈αthe␈αform:
00600
00700 ␈↓∞ dec → rec
00800
00900
01000 ␈↓It␈α∞defines␈αa␈α∞partial␈α∞function␈αon␈α∞streams␈α∞as␈αfollows:␈α∞if␈α∞the␈αinput
01100 stream␈α∂matches␈α∂the␈α∞dec,␈α∂then␈α∂the␈α∞output␈α∂stream␈α∂is␈α∞generated
01200 by␈α∃the␈α∀rec.␈α∃The␈α∀following␈α∃rule␈α∀(given␈α∃as␈α∀an␈α∃example␈α∀only)
01300 could␈αbe␈αpart␈αof␈αa␈αquestion␈αanswering␈αfunction:
01400
01500 ␈↓∞ How are you ? → Very well and you ?
01600
01700 ␈↓If␈αthe␈αinput␈αstream␈αconsists␈αof␈αthe␈αfour␈αtokens:
01800
01900 ␈↓∞ How are you ?
02000
02100 ␈↓the␈αoutput␈αstream␈αwill␈αconsist␈αof␈αthe␈αfive␈αtokens:
02200
02300 ␈↓∞ Very well and you ?
02400
02500 ␈↓␈↓
02600 REWRITE FUNCTIONS ␈↓
02700
02800 ␈↓ A␈α∨rewrite␈α≡rule␈α∨defines␈α≡a␈α∨partial␈α∨function,␈α≡for
02900 example,␈α↔the␈α↔mapping␈α↔of␈α↔some␈α↔particular␈α↔token␈α↔into␈α⊗some
03000 other␈α↔particular␈α↔token.␈α↔ A␈α↔broader␈α↔partial␈α↔function␈α↔can␈α⊗be
03100 defined␈α⊗as␈α⊗the␈α⊗union␈α⊗of␈α⊗several␈α⊗rewrite␈α⊗rules.␈α⊗ A␈α∃rewrite
03200 function␈αdefinition␈αis␈αof␈αthe␈αform:
03300
03400 ␈↓∞ RULES OF <name> =
03500 dec1 → rec1,
03600 dec2 → rec2,
03700 ...
03800 decn → recn;
03900
04000 ␈↓␈↓
04100 VARIABLES ␈↓
04200
04300 ␈↓ A␈α⊃function␈α⊃is␈α⊃difficult␈α⊃to␈α⊃define␈α⊃if␈α⊃every␈α∩case␈α⊃must
04400 be␈α∩enumerated.␈α∩ Therefore,␈α∪rewrite␈α∩rules␈α∩allow␈α∪variables␈α∩to
04500 appear␈α⊂in␈α⊂patterns.␈α⊂ The␈α⊂value␈α⊂of␈α⊂a␈α⊂variable␈α⊂can␈α⊂be␈α⊃either␈α⊂a
04600 list␈αor␈αan␈αatom.␈α In␈αthis␈αpaper␈αthe␈αnotation:
04700
04800 ␈↓∞ :X
04900
05000 ␈↓where␈α∀X␈α∪ia␈α∀any␈α∪identifier,␈α∀will␈α∪denote␈α∀the␈α∪variable␈α∀X.␈α∪ The
05100 variables␈α∩of␈α∩each␈α∪rule␈α∩are␈α∩distinct␈α∪from␈α∩the␈α∩variables␈α∪of␈α∩all
05200 other␈αrules,␈αeven␈αif␈αtheir␈αnames␈αare␈αthe␈αsame.
05300
05400 The␈αfollowing␈αdefinition␈αhas␈αonly␈αthree␈αrewrite␈αrules,
05500 but␈αhandles␈αan␈αunlimited␈αnumber␈αof␈αinput␈αstreams:
00100 ␈↓∞ RULES OF REPLY=
00200 HOW ARE YOU '? → VERY WELL '?
00300 HOW IS :X → I HAVEN''T SEEN :X ',
00400 LATELY'.,
00500 DID :X GO TO :Y '? →
00600 WHY DON''T YOU ASK :X YOURSELF'?;
00700
00800 ␈↓A␈α∞variable␈αcan␈α∞appear␈α∞more␈αthan␈α∞once␈α∞in␈αa␈α∞single␈α∞dec␈αpattern,
00900 but␈α≡it␈α≡must␈α≡match␈α≥identical␈α≡items␈α≡at␈α≡each␈α≥appearance.
01000 Example:
01100
01200 ␈↓∞ RULES OF EQUAL =
01300 (EQUAL :X :X) → TRUE;
01400
01500 ␈↓␈↓
01600 ELLIPSIS ␈↓
01700
01800 ␈↓ To␈α↔make␈α⊗patterns␈α↔easier␈α⊗to␈α↔read␈α⊗and␈α↔write,␈α⊗the
01900 ellipsis␈α_symbol␈α→...␈α_can␈α→be␈α_used␈α_to␈α→stand␈α_for␈α→an␈α_unnamed
02000 variable.␈α Thus:
02100
02200
02300 ␈↓∞ IS ... COMING → NO, ... COULD NOT MAKE IT.
02400
02500 ␈↓If␈αan␈αellipsis␈α(...)␈αoccurs␈αseveral␈αtimes␈αon␈αa␈αside,␈αit␈αdesignates␈αa
02600 different␈α~variable␈α~each␈α~time.␈α~ The␈α~n'th␈α~ellipsis␈α~in␈α≠a␈α~dec
02700 designates␈αthe␈αsame␈αvariable␈αas␈αthe␈αn'th␈αellipsis␈αin␈αthe␈αrec.
02800
02900 Ellipsis␈α_is␈α_one␈α_of␈α_the␈α_principle␈α_ideas␈α→of␈α_pattern
03000 matching.␈α It␈αpermits␈αimprecise␈αmathching;␈αthat␈αis,␈αthe␈αemphasis
03100 or␈αignoring␈αof␈αitems.
03200
03300 ␈↓
03400 AUTOMATIC ORDERING OF RULES ␈↓
03500
03600 ␈↓ The␈α⊂order␈α⊂of␈α⊂rules␈α⊂in␈α⊂a␈α⊂function␈α⊂definition␈α⊂does␈α∂not
03700 specify␈α⊃the␈α⊃order␈α⊃in␈α⊃which␈α⊃the␈α⊃system␈α⊃will␈α⊃attempt␈α⊃to␈α⊃apply
03800 them.␈α∂ This␈α∂ordering␈α∂operation␈α∞is␈α∂handled␈α∂by␈α∂a␈α∂special␈α∞system
03900 ordering␈αfunction.␈α Consider␈αthe␈αrewrite␈αfunction:
04000
04100 ␈↓∞ RULES OF REPLY =
04200 I SEE :X → SO WHAT '?,
04300 I SEE ANN → WOW '!;
04400
04500 ␈↓Both␈αrules␈αwould␈αmatch:
04600
04700 ␈↓∞ I SEE ANN
04800
04900 ␈↓In␈α∞such␈α∂cases␈α∞the␈α∂more␈α∞specific␈α∂rule␈α∞takes␈α∂precedence.␈α∞ Thus,
05000 given:
05100
05200 ␈↓∞ I SEE ANN
05300
05400 ␈↓as␈αthe␈αinput␈αstream␈α,␈αthe␈αoutput␈αstream␈αwould␈αbe:
05500
05600 ␈↓∞ WOW !
05700
05800 ␈↓but␈αgiven:
05900
06000 ␈↓∞ I SEE STARS
06100
06200 ␈↓the␈αoutput␈αstream␈αwould␈αbe:
00100 ␈↓∞ SO WHAT ?
00200
00300 ␈↓A␈αliteral␈αis␈α∞more␈αspecific␈αthan␈α∞a␈αvariable.␈αA␈α∞variable␈αappearing
00400 for␈αthe␈α∞second␈αtime␈α∞is␈αmore␈αspecific␈α∞than␈αa␈α∞variable␈αappearing
00500 for␈α∪the␈α∩first␈α∪time␈α∩in␈α∪a␈α∩dec.␈α∪ This␈α∩is␈α∪so␈α∩because␈α∪the␈α∩second
00600 occurence␈αof␈αthe␈αvariable␈αmust␈αmatch␈αthe␈αsame␈αpattern␈αas␈αthe
00700 first␈α∪occurence.␈α∩The␈α∪precedence␈α∩function␈α∪is␈α∩itself␈α∪written␈α∩in
00800 rewrites␈αand␈αso␈αis␈αboth␈αextendable␈αand␈αchangable␈αby␈αthe␈αuser.
00900 Currently␈α precedence␈α∨is␈α calculated␈α by␈α∨a␈α left␈α to␈α∨right
01000 application␈α⊗of␈α⊗the␈α⊗above␈α⊗criteria.␈α⊗ Therefore,␈α↔the␈α⊗following
01100 function␈αdefines␈αthe␈αLISP␈αfunction␈αEQUAL:
01200
01300 ␈↓∞ RULES OF EQUAL =
01400 (EQUAL :X :X) → T,
01500 (EQUAL :X :Y) → NIL;
01600
01700 ␈↓␈↓
01800 SEGMENTS ␈↓
01900
02000 ␈↓ Sometimes␈α⊗ it␈α↔ is␈α⊗ desirable␈α↔ for␈α⊗ a␈α↔ variable␈α⊗ to
02100 match␈α an␈αindeterminate␈αnumber␈αof␈αitems.␈α This␈αis␈αnotated:
02200
02300 ␈↓∞ ::X
02400
02500 ␈↓Use␈α∞of␈αthe␈α∞double-colon␈α∞("::")␈αmeans␈α∞that␈αthe␈α∞variable␈α∞(e.g.,␈αX)
02600 will␈αmatch␈αzero␈αor␈αmore␈αitems.␈α Example:
02700
02800 ␈↓∞ RULES OF APPEND=
02900 (APPEND (::X)(::Y)) → (::X ::Y);
03000
03100 ␈↓or␈αif␈αthe␈αinput␈αstream␈αwere:
03200
03300 ␈↓∞ (APPEND (A B) (C D E))
03400
03500 ␈↓the␈αoutput␈αstream␈αwould␈αbe:
03600
03700 ␈↓∞ (A B C D E)
03800
03900 ␈↓For␈αincreased␈αreadability␈αthe␈αrule␈αcould␈αalso␈αbe␈αwritten:
04000
04100 ␈↓∞ RULES OF APPEND =
04200 (APPEND (...) (...)) → (... ...);
04300
04400 ␈↓Another␈αexample:
04500
04600 ␈↓∞ RULES OF REPLY =
04700 WHERE DID ::X GO →
04800 ::X WENT HOME '.;
04900
05000 ␈↓Therefore,
05100
05200 ␈↓∞ WHERE DID THE CARPENTER GO →
05300 THE CARPENTER WENT HOME.
05400
05500 ␈↓␈↓
05600 APPLICATION ␈↓
05700
05800 ␈↓ One␈α∂of␈α∂the␈α∞main␈α∂deficiencies␈α∂of␈α∞the␈α∂system␈α∂in␈α∞which
05900 the␈α⊂MAD␈α⊂DOCTOR␈α⊂was␈α⊃programmed␈α⊂was␈α⊂its␈α⊂lack␈α⊃of␈α⊂adequat`α@
06000 subroutining␈α∩capability.␈α∩ Subroutines␈α∩may␈α∩be␈α∩indicated␈α∩in␈α∩the
06100 rewrite␈αsystem␈αas␈αfollows:
00100 ␈↓∞ RULES OF LAST =
00200 ␈↓∞ () → (),
00300 (:X) → :X,
00400 (:X ...) → <LAST (...)>;
00500
00600 ␈↓The␈α⊂"<>"␈α⊃surrounding␈α⊂a␈α⊃pattern␈α⊂means␈α⊃that␈α⊂the␈α⊃current␈α⊂input
00700 stream␈α⊃is␈α⊃to␈α⊃be␈α⊃pushed␈α⊂down,␈α⊃that␈α⊃the␈α⊃function␈α⊃indicated␈α⊂by
00800 the␈α∂firtoken␈α∂within␈α∂the␈α⊂brackets␈α∂is␈α∂to␈α∂be␈α∂entered␈α⊂with␈α∂the
00900 rest␈αof␈αthe␈αpattern␈αappended␈αto␈αthe␈αfront␈αof␈αthe␈α∞input␈αstream,
01000 and␈α∂that␈α⊂the␈α∂output␈α∂stream␈α⊂is␈α∂to␈α∂be␈α⊂placed␈α∂into␈α⊂the␈α∂restored
01100 current␈α↔input␈α_stream.␈α↔Note␈α↔that␈α_MLISP2␈α↔functions␈α_may␈α↔be
01200 called␈αas␈αwell␈αas␈αrewrite␈αfunctions.
01300
01400 ␈↓
01500 GOALS ␈↓
01600
01700 ␈↓ To␈α→gain␈α→the␈α~advantage␈α→of␈α→goal␈α~directed␈α→pattern
01800 matching␈α∂and␈α⊂computing,␈α∂as␈α⊂well␈α∂as␈α∂the␈α⊂full␈α∂power␈α⊂of␈α∂context
01900 sensitive␈αgrammars,␈αthe␈αfollowing␈αform␈αmay␈αbe␈αused:
02000
02100 ␈↓∞ RULES OF PREPOSITIONAL_PHRASE =
02200 <PREPOSITION>:P <NOUN_PHRASE>:N
02300 → (PREP_PH :P :N);
02400
02500 ␈↓The␈α∃identifer␈α∃between␈α∃the␈α∃angled␈α∃brackets␈α∃("<>")␈α∃names␈α∀a
02600 rewrite␈α∞function␈α∞the␈α∞rules␈α∞of␈α∞which␈α∞are␈α∞to␈α∞be␈α∞matched␈α∞against
02700 the␈α∞input␈α∂stream.␈α∞ When␈α∂a␈α∞match␈α∞occurs␈α∂the␈α∞output␈α∂stream␈α∞of
02800 the␈αgoal␈αwill␈αbe␈αbound␈αto␈αthe␈αassociated␈αvariable.␈αExample:
02900
03000 ␈↓∞ RULES OF PREPOSITIONAL_PHRASE =
03100 <PREPOSITION>:P <NOUN_PHRASE>:N
03200 → (PREP_PH :P :N);
03300
03400 RULES OF NOUN_PHRASE =
03500 TOWN → (NOUN_PH TOWN),
03600 PALO ALTO → (NOUN_PH PALO_ALTO);
03700
03800 RULES OF PREPOSITON =
03900 IN → IN,
04000 ON → ON;
04100
04200 ␈↓and␈αthe␈αinput␈αstream:
04300
04400 ␈↓∞ IN PALO ALTO
04500
04600 ␈↓the␈αoutput␈αstream␈αwould␈αbe:
04700
04800 ␈↓∞ (PREP_PH IN (NOUN_PH PALO_ALTO))
04900
05000 ␈↓␈↓
05100 OPTIONALS ␈↓
05200
05300 ␈↓ Many␈α≠other␈α~shorthands␈α≠exist␈α~to␈α≠simplify␈α~writing
05400 rules.␈α⊃One␈α⊃useful␈α⊃feature␈α⊃that␈α⊃will␈α⊃be␈α⊃mentioned␈α⊃here␈α⊃is␈α⊂the
05500 optional.
05600
05700 ␈↓∞ RULES OF AUXILARY_PHRASE =
05800 <AUXILARY>:A [<NEGATIVE>:N]:N1 →
05900 (AUX_PH :A [:N]:N1 );
06000
06100 ␈↓If␈α⊗the␈α⊗optional␈α⊗pattern,␈α⊗enclosed␈α⊗in␈α⊗square␈α↔brackets␈α⊗("[]"),
00100 occurs␈α∞in␈α∞the␈α∞input␈α∞stream␈α∞it␈α∂will␈α∞be␈α∞bound␈α∞to␈α∞:N.␈α∞ :N1␈α∂will␈α∞be
00200 ␈↓bound␈α⊃to␈α⊃2.␈α∩ If␈α⊃the␈α⊃<NEGATIVE>␈α⊃does␈α∩not␈α⊃occur,␈α⊃:N1␈α∩will␈α⊃be
00300 bound␈αto␈α1.␈αOn␈αthe␈αrec␈αside␈αof␈αthe␈αrules␈αif␈α:N1␈αis␈α2␈αthen␈α:N␈αwill
00400 be␈α∀placed␈α∀in␈α∀the␈α∀output␈α∀stream.␈α∀ If␈α∀it␈α∀is␈α∀1␈α∀then␈α∀nothing␈α∪is
00500 placed␈α∞in␈α∞the␈α∞output␈α∞stream␈αat␈α∞that␈α∞point.␈α∞ Example,␈α∞given␈αthe
00600 rule␈αabove:
00700
00800 ␈↓∞ DO → (AUX_PH DO)
00900 DO NOT → (AUX_PH DO NOT)
01000
01100 ␈↓␈↓
01200 MORE EXAMPLES ␈↓
01300
01400 ␈↓ We␈αhave␈αcollected␈αa␈αlarge␈αnumber␈αof␈αdialogues␈αusing
01500 our␈α∩previous␈α⊃program␈α∩PARRY.␈α⊃ These␈α∩dialogues␈α⊃form␈α∩a␈α⊃large
01600 body␈α∞of␈αexamples␈α∞of␈αthe␈α∞kind␈αof␈α∞English␈αwhich␈α∞we␈α∞can␈αexpect.
01700 Martin␈α∨Frost,␈α∨a␈α≡graduate␈α∨student␈α∨in␈α∨Computer␈α≡Science,
01800 Stanford␈α≥University,␈α≡ has␈α≥written␈α≡a␈α≥keyword␈α≡in␈α≥context
01900 program␈α∀which␈α∀enables␈α∀us␈α∪to␈α∀isolate␈α∀examples␈α∀centered␈α∪on
02000 particular␈α⊗words␈α⊗so␈α⊗that␈α∃uses␈α⊗of␈α⊗thoses␈α⊗words␈α⊗in␈α∃context
02100 become␈α∩more␈α∪apparent.␈α∩ Our␈α∪general␈α∩approach␈α∪is␈α∩to␈α∪build␈α∩a
02200 system␈α∀which␈α∀can␈α∀produce␈α∀desired␈α∀intreptations␈α∀from␈α∀these
02300 examples␈α∂and␈α∞to␈α∂incrementally␈α∂add␈α∞to␈α∂the␈α∞rules␈α∂in␈α∂the␈α∞system
02400 as␈α~new␈α~cases␈α→are␈α~discovered␈α~during␈α→the␈α~running␈α~of␈α→the
02500 program.
02600
02700 ␈↓ Following␈α!are␈α!some␈α examples␈α!of␈α!commonly␈α occuring
02800 situations␈α∞and␈αexamples␈α∞of␈α∞the␈αkind␈α∞of␈α∞rules␈αwe␈α∞use␈α∞to␈αhandle
02900 them.
03000
03100 ␈↓
03200 QUESTION INTRODUCER ␈↓
03300
03400 ␈↓ In␈α∃doctor-patient␈α⊗dialogues␈α∃it␈α⊗is␈α∃quite␈α⊗common␈α∃to
03500 introduce␈αa␈α∞question␈αby␈α∞the␈αuse␈αof␈α∞a␈αcommand.␈α∞ The␈α"question
03600 introducer"␈α↔is␈α⊗followed␈α↔by␈α⊗either␈α↔a␈α⊗<NOUN_PHRASE>␈α↔or␈α⊗a
03700 <DECLARATIVE_SENTENCE>.␈αFor␈αexample,
03800
03900 ␈↓∞ COULD YOU TELL ME YOUR NAME?
04000
04100 ␈↓Rather␈α⊃than␈α⊃attempt␈α⊂a␈α⊃literal␈α⊃analysis␈α⊂of␈α⊃this␈α⊃question,␈α⊂which
04200 might␈αlead␈αto␈αthe␈αinterpretation:
04300
04400 ␈↓∞ DO YOU HAVE THE ABILITY TO SPEAK YOUR NAME TO ME?
04500
04600 ␈↓we␈αutilize␈αrules␈αlike:
04700
04800 ␈↓∞ RULES OF SENTENCE =
04900 <QUESTION_INTRODUCER>:Q <NOUN_PHRASE>:N
05000 → (IS :N '*'?'* );
05100
05200 RULES OF QUESTION_INTRODUCER =
05300 COULD YOU TELL ME → ,
05400 WOULD YOU TELL ME → ,
05500 PLEASE TELL ME → ;
05600
05700 ␈↓ Although␈α⊂it␈α⊂is␈α⊂conceivable␈α∂that␈α⊂there␈α⊂are␈α⊂an␈α∂infinite
05800 number␈α∩of␈α∩ways␈α∩to␈α∩introduce␈α∩a␈α∩question␈α∩in␈α∩this␈α∩manner,␈α∩we
05900 have␈α⊂found␈α⊂only␈α⊂about␈α⊃six␈α⊂literal␈α⊂strings␈α⊂are␈α⊂actually␈α⊃used␈α⊂in
06000 our␈α⊂data␈α⊂base␈α⊂of␈α⊂dialogues.␈α∂ When␈α⊂we␈α⊂discover␈α⊂a␈α⊂new␈α∂string
06100 we␈αincrementally␈αadd␈αa␈αrule.␈α When␈αwe␈αhave␈αenough␈αexamples
00100 to␈α∃dectect␈α∃a␈α⊗more␈α∃general␈α∃form␈α⊗we␈α∃replace␈α∃the␈α⊗rules␈α∃for
00200 ␈↓<QUESTION_INTRODUCER>␈α~by␈α~a␈α~more␈α~elegant␈α~and␈α→general
00300 formulation.␈α⊗ This␈α⊗approach␈α⊗allows␈α⊗us␈α⊗to␈α↔process␈α⊗dialogues
00400 before␈α⊂we␈α⊂have␈α⊂a␈α⊂complete␈α⊂analysis␈α⊂of␈α⊂all␈α⊃possible␈α⊂sentence
00500 constructions,␈α∪and␈α∪it␈α∪allows␈α∪us␈α∪to␈α∪build␈α∪a␈α∀language␈α∪analyzer
00600 based␈αon␈αactually␈αoccurring␈αforms.
00700
00800 Notice␈α∀that␈α∀it␈α∀is␈α∪possible␈α∀to␈α∀make␈α∀more␈α∀than␈α∪one
00900 analysis␈α∪of␈α∀any␈α∪given␈α∀sentence␈α∪depending␈α∀on␈α∪what␈α∀is␈α∪being
01000 looked␈α⊗for.␈α↔A␈α⊗poet␈α↔might␈α⊗be␈α⊗interested␈α↔in␈α⊗the␈α↔number␈α⊗of
01100 syllables␈α→per␈α→word␈α→and␈α~the␈α→patterns␈α→of␈α→stress.␈α~ A␈α→"full"
01200 analysis␈α∪of␈α∪English␈α∀must␈α∪allow␈α∪for␈α∀this␈α∪possibility,␈α∪ but␈α∀it␈α∪it
01300 clearly␈αfoolish␈αto␈αproduce␈αthis␈αkind␈αof␈αanalysis␈αfor␈αPARRY.␈α Our
01400 analysis␈α⊃will␈α⊂be␈α⊃partial␈α⊂and␈α⊃idiosyncratic␈α⊂to␈α⊃the␈α⊂needs␈α⊃of␈α⊂our
01500 program.␈α This␈αis␈αwhat␈αis␈αmeant␈αby␈αidiolectic.
01600
01700 ␈↓
01800 FILLERS ␈↓
01900
02000 ␈↓ It␈α⊗is␈α↔quite␈α⊗common␈α↔for␈α⊗interviewers␈α↔to␈α⊗introduce
02100 words␈α∩of␈α∩little␈α∩significance␈α∩to␈α∩PARRY␈α∩into␈α∩the␈α∩sentence.␈α⊃For
02200 example:
02300
02400 ␈↓∞ WELL, WHAT IS YOUR NAME?
02500
02600 ␈↓The␈α∀"well"␈α∀in␈α∀this␈α∀sentence␈α∀serves␈α∀no␈α∀purpose␈α∃in␈α∀PARRY's
02700 analysis,␈α∞although␈α∞it␈α∞might␈α∞to␈α∞a␈α∞linguist␈α∞interested␈α∂in␈α∞hesitation
02800 phenomena.␈α∞ These␈α∞fillers␈α∞can␈α∞be␈α∞ignored.␈α∞ The␈α∞following␈αrules
02900 accomplish␈αthis:
03000
03100 ␈↓∞ RULES OF SENTENCE =
03200 <FILLERS>:F <SENTENCE>:S → :S;
03300
03400 RULES OF FILLERS =
03500 WELL → ,
03600 OK → ;
03700
03800 ␈↓␈↓
03900 PUNCTUATION ␈↓
04000
04100 ␈↓ Interviewers␈α∂use␈α∂little␈α∂intra-sentence␈α∂punctuation␈α∞in
04200 talking␈α∪to␈α∪PARRY.␈α∪ When␈α∪it␈α∪is␈α∪used␈α∪it␈α∪is␈α∪often␈α∀to␈α∪seperate
04300 phrases␈αthat␈αmight␈αotherwise␈αbe␈αambiguous.␈α Example:
04400
04500 ␈↓∞ WHY WEREN'T YOU VERY CLOSE, FRANK
04600
04700 ␈↓Here␈α∪the␈α∪comma␈α∪clearly␈α∩puts␈α∪"CLOSE"␈α∪in␈α∪a␈α∪different␈α∩phrase
04800 from␈α⊃"FRANK".␈α⊃Punctuation,␈α⊃ when␈α⊃used␈α⊃in␈α⊃PARRY's␈α⊃rules,␈α⊂ is
04900 generally␈α∀enclosed␈α∪in␈α∀optional␈α∀brackets␈α∪("[]").␈α∀ This␈α∀has␈α∪the
05000 effect␈αof␈αseperating␈αphrases␈αwhen␈αpunctuation␈αis␈αused,␈αbut␈αnot
05100 requiring␈αfull␈αpunctuation␈αfor␈αthe␈αsystem␈αto␈αwork.␈αExample:
05200
05300 ␈↓∞ RULES OF SENTENCE =
05400 <SENTENCE>:S1 [',]:C <SENTENCE_CONNECTOR>:SC
05500 <SENTENCE>:S2
05600 → (CONUNCTION :SC :S1 :S2);
05700
05800
05900 ␈↓␈↓
06000 CLICHES AND IDIOMS ␈↓
06100
06200 ␈↓ The␈α(English␈α)we␈α(encounter␈α)in␈α(doctor-patient
00100 dialogues␈α∞is␈α∞made␈α∂up␈α∞of␈α∞a␈α∞great␈α∂number␈α∞of␈α∞cliches␈α∂and␈α∞idioms,
00200 ␈↓therefore␈α⊃we␈α∩anticipate␈α⊃a␈α∩large␈α⊃number␈α∩of␈α⊃rules␈α∩devoted␈α⊃to
00300 them.␈α For␈αexample:
00400
00500 ␈↓∞ RULES OF TIME_PHRASES =
00600 A COUPLE OF <TIME_UNIT>:T AGO
00700 → (TIME (RELATIVE PAST)(REF PRESENT) :T);
00800
00900 RULES OF TIME_UNIT =
01000 SECONDS → (WITHIN CONVERSATION),
01100 MOMENTS → (WITHIN CONVERSATION),
01200 DAYS → (BEFORE CONVERSATION DAYS);
01300
01400 ␈↓␈↓
01500 REPRESENTATION CORRECTION ␈↓
01600
01700 ␈↓ Intermediate␈α∪results␈α∪are␈α∩often␈α∪produced␈α∪which␈α∩are
01800 misleading␈α∪in␈α∪meaning␈α∪or␈α∪are␈α∪in␈α∪the␈α∪wrong␈α∪form␈α∀for␈α∪further
01900 processing.␈α∂We,␈α∂therefore,␈α∞incorporate␈α∂at␈α∂various␈α∂points␈α∞rules
02000 which␈α≡detect␈α≥certain␈α≡undesired␈α≥intermediate␈α≡results␈α≥and
02100 convert␈αthem␈αto␈αthe␈αdesired␈αform.␈α Example:
02200
02300 ␈↓∞ RULES OF CORRECT_FORM =
02400 (QUESTION ... (SENTENCE ...)) →
02500 (QUESTION ... ...);
02600
02700 ␈↓␈↓
02800 UNKNOWN WORDS ␈↓
02900
03000 ␈↓ Rules␈α⊃can␈α⊃be␈α∩derived␈α⊃to␈α⊃handle␈α⊃words␈α∩which␈α⊃were
03100 previously␈αunknown␈αto␈αthe␈αsystem.␈α For␈αexample:
03200
03300 ␈↓∞ RULES OF UNKNOWN_WORD =
03400 DR'. :X → <NEW_WORD NAME :X>,
03500 THE :X <VERB_PHRASE>:V →
03600 <NEW_WORD NOUN :X>,
03700 I :X YOU → <NEW_WORD VERB :X>;
03800
03900 ␈↓Here␈α"NEW_WORD"␈αis␈αa␈αfunction␈αwhich␈αadds␈αnew␈αwords␈αto␈αthe
04000 dictionary.
04100
04200 ␈↓
04300 CONCLUSION ␈↓
04400
04500 ␈↓ We␈α∞are␈αfaced␈α∞with␈α∞the␈αproblems␈α∞of␈α∞natural␈αlanguage
04600 being␈α∩used␈α∩to␈α∩interview␈α∩people␈α∩in␈α∩a␈α∪doctor-patient␈α∩context.
04700 We␈α⊃have␈α⊂developed␈α⊃a␈α⊃language␈α⊂processing␈α⊃system␈α⊃which␈α⊂we
04800 believe␈α⊗is␈α⊗capable␈α∃of␈α⊗performing␈α⊗in␈α∃these␈α⊗interviews␈α⊗at␈α∃a
04900 significantly␈α≥improved␈α≥level␈α≤of␈α≥performance␈α≥compared␈α≤to
05000 systems␈α∀used␈α∀in␈α∀the␈α∀past.␈α∀ We␈α∀have␈α∃developed␈α∀techniques
05100 which␈α⊂can␈α⊃measure␈α⊂performance␈α⊂in␈α⊃comparison␈α⊂with␈α⊃the␈α⊂ideal
05200 of␈α⊗a␈α⊗real␈α↔human␈α⊗patient␈α⊗in␈α↔the␈α⊗same␈α⊗context.␈↓↓4,5,7␈↓␈α↔We␈α⊗are
05300 designing␈αour␈αsystem␈α∞with␈αthe␈αrealization␈α∞that␈αa␈αlong␈α∞period␈αof
05400 development␈α"is␈α"necessary␈α"to␈α"reach␈α"desired␈α"levels␈α!of
05500 performance.␈α⊂ This␈α⊂is␈α⊃a␈α⊂system␈α⊂that␈α⊃can␈α⊂work␈α⊂at␈α⊃a␈α⊂measured
05600 level␈α∪of␈α∪performance␈α∪and␈α∪be␈α∪improved␈α∪over␈α∪time␈α∀with␈α∪new
05700 rules␈α⊃having␈α⊃minimum␈α⊃interaction␈α⊃with␈α⊃those␈α∩already␈α⊃existing.
05800 Our␈α⊂system␈α⊂is␈α⊂designed␈α⊂so␈α∂that␈α⊂a␈α⊂complete␈α⊂analysis␈α⊂of␈α∂every
05900 word␈αor␈αphrase␈αof␈αan␈αutterance␈αis␈αnot␈αneceesary.
06000
06100 ␈↓ The␈α∀basis␈α∀of␈α∀this␈α∀system␈α∀is␈α∀a␈α∀rewrite␈α∪interpreter
06200 which␈α↔will␈α↔automatically␈α↔merge␈α↔new␈α↔rules␈α↔into␈α↔the␈α_set␈α↔of
00100 already␈α⊗existing␈α∃rules␈α⊗so␈α∃that␈α⊗the␈α∃system␈α⊗will␈α⊗continue␈α∃to
00200 ␈↓handle␈αsentences␈αwhich␈αit␈αhandled␈αin␈αthe␈αpast.
00300
00400 ␈↓ α>␈↓
00500 REFERENCES␈↓
00600
00700 ␈↓␈↓↓1␈↓Carroll,␈α⊃J.B.␈α⊃Language␈α⊃and␈α⊃Thought.␈α⊃Prentice-Hall,␈α⊂Englewood
00800 Cliffs,␈α New␈αJersey,␈αp.␈α59.
00900
01000 ␈↓↓2␈↓Colby,␈α K.M.,␈α Watt,␈αJ.␈αand␈α Gilbert,␈αJ.P.␈α A␈α computer␈α method
01100 of␈α⊃psychotherapy.␈α∩Journal␈α⊃of␈α∩Nervous␈α⊃and␈α∩Mental␈α⊃Disease,
01200 142,148-152,1966.
01300
01400 ␈↓↓3␈↓Colby,K.M.␈α∨and␈α∨Enea,H.␈α∨Heuristic␈α∨methods␈α for␈α∨computer
01500 understanding␈α∩ of␈α∩natural␈α∩language␈α∩in␈α∪context␈α∩restricted
01600 on-line␈αdialogues.␈α Mathematical␈αBiosciences,1,1-25,1967.
01700
01800 ␈↓↓4␈↓Colby,␈α⊂K.M.,␈α⊃Hilf,␈α⊂F.D.,␈α⊃Weber,␈α⊂S.,␈α⊃and␈α⊂Kraener,␈α⊃H.␈α⊂Turing-like
01900 indistinguishability␈α∃tests␈α∃for␈α∀the␈α∃validation␈α∃of␈α∃a␈α∀computer
02000 simulation␈α?␈α⊂of␈α?␈α⊂paranoid␈α?␈α⊂processes.␈α?␈α⊂Artificial
02100 Intelligence,3,199-221,1972.
02200
02300 ␈↓↓5␈↓Colby,␈α!K.M.␈α!and␈α"Hilf,␈α!F.D.␈α!Multidimensional␈α"analysis␈α!in
02400 evaluating␈α≥the␈α≤adequacy␈α≥of␈α≤a␈α≥simulation␈α≥of␈α≤paparnoid
02500 processes.␈α∪ Memo␈α∀AIM-194.␈α∪Stanford␈α∀Artificial␈α∪Intelligence
02600 project,␈αStanford␈αUniversity.
02700
02800 ␈↓↓6␈↓Enea,␈α⊂H.␈α∂MLISP,␈α⊂Technical␈α∂report␈α⊂no.␈α∂CS-92,␈α⊂1968,␈α∂Computer
02900 Science␈αDepartment,␈αStanford␈αUniversity.
03000
03100 ␈↓↓7␈↓Rubenstein,␈α↔A.H.␈α↔and␈α↔Haberstroh,␈α⊗C.␈α↔J.,␈α↔Some␈α↔Theories␈α⊗of
03200 Organization,␈αDorsey␈αPress,␈αHomewood␈αIll.,1960,␈αp.␈α232.
03300
03400 ␈↓↓8␈↓Schank,␈α∩R.C.,␈α∩Tesler,␈α∪L.␈α∩and␈α∩Weber,S.␈α∩Spinoza␈α∪ii:␈α∩Conceptual
03500 case-based␈αnatural␈αlanguage␈αanalysis.␈αMemo␈αAIM-109,␈α1970,
03600 Stanford␈αArtificial␈αIntelligence␈αProject,␈αStanford␈αUniversity.
03700
03800 ␈↓↓9␈↓Simmons,␈α~R.F.␈α→Some␈α~semantic␈α→structures␈α~for␈α→representing
03900 English␈α&meanings.␈α&Preprint,␈α&1970,␈α'Computer␈α&Science
04000 Department,␈αUniversity␈αof␈αTexas,␈αAustin.
04100
04200 ␈↓↓10␈↓Smith,␈α⊂D.C.,␈α⊂MLISP,␈α∂Memo␈α⊂AIM-135,␈α⊂1970,␈α⊂Stanford␈α∂Artificial
04300 Intelligence␈αProject,␈αStanford␈αUniversity.
04400
04500 ␈↓↓11␈↓Weizenbaum,␈α∂J.␈α∂Eliza-␈α∂a␈α∂computer␈α∂program␈α∂for␈α∂the␈α⊂study␈α∂of
04600 natural␈α-communication␈α.between␈α-man␈α.and␈α-machine.
04700 Communications␈αof␈αthe␈αACM,␈α9,36-45,1966.
04800
04900 ␈↓↓12␈↓Wilks,␈αY.A.␈αUnderstanding␈αwithout␈αproofs.␈α(See␈αthis␈αvolume).
05000
05100 ␈↓↓13␈↓Winograd,␈α⊃T.␈α⊂A␈α⊃program␈α⊂for␈α⊃understanding␈α⊃natural␈α⊂language.
05200 Cognitive␈αPsychology,3,1-191,1972.
05300
05400 ␈↓↓14␈↓Woods,␈α∨W.A.␈α∨Transition␈α∨network␈α∨grammars␈α∨for␈α≡natural
05500 language␈α≠analysis.␈α≠ Communications␈α≠of␈α≠the␈α≠ACM,13,591-
05600 606,1970.